Heuristics-based Semantic Annotation for Deep Web Query Results
نویسندگان
چکیده
Deep Web data is encoded in HTML pages through query-processing, the structural information of database mode is completely lost. In addition, current query-results are only for human browsing. For a higher use value of query-results, they must be understandable and processable for machines. Deep Web semantic annotation aims to add correct semantic to query-results, which enables the computer understand and process these data. This paper proposes a heuristics-based semantic annotation method. According to the characteristic analysis of interface page and result page, this paper summarizes some heuristic information. This method uses this heuristic information in turn to analyze the data to be annotated, for identifying a semantic vocabulary for each data unit. Finally, it performs a semantic annotation experiment on the Deep Web data of various areas in the UIUC standard dataset. The experimental result indicates our approach is highly effective. Compared with Ontology-based annotation (OBA) method, our method has a better performance.
منابع مشابه
Query expansion based on relevance feedback and latent semantic analysis
Web search engines are one of the most popular tools on the Internet which are widely-used by expert and novice users. Constructing an adequate query which represents the best specification of users’ information need to the search engine is an important concern of web users. Query expansion is a way to reduce this concern and increase user satisfaction. In this paper, a new method of query expa...
متن کاملQuery Architecture Expansion in Web Using Fuzzy Multi Domain Ontology
Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...
متن کاملUnveiling the hidden bride: deep annotation for mapping and migrating legacy data to the Semantic Web
The success of the Semantic Web crucially depends on the easy creation, integration and use of semantic data. For this purpose, we consider an integration scenario that defies core assumptions of current metadata construction methods. We describe a framework of metadata creation where web pages are generated from a database and the database owner is cooperatively participating in the Semantic W...
متن کاملSemantification of Query Interfaces to Improve Access to Deep Web Content
This position paper as part of a PhD thesis is a contribution to an automatic retrieval of information from the Deep Web. Addressing current limitations of the Deep Web Information Retrieval leads to the prevailing lack of semantics regarding the retrieval process. Focusing this problem from the information providing services perspective, indicates the significant potential of additional semant...
متن کاملReview on Automatic Annotation of Query Results from Deep Web Database
In recent years, web database extraction and annotation has received much attention from the database and Information Extraction(IE) in research area due to the volume and quality of deep web. Many web databases are accessible through HTML formbased interface. When query is submitted to the search interface the query result page is generated. Search Result Records(SRRs) are the result pages obt...
متن کامل